Evolutionary reinforcement learning via cooperative coevolutionary negatively correlated search

نویسندگان

چکیده

Evolutionary algorithms (EAs) have been successfully applied to optimize the policies for Reinforcement Learning (RL) tasks due their exploration ability. The recently proposed Negatively Correlated Search (NCS) provides a distinct parallel search behavior and is expected facilitate RL more effectively. Considering that commonly adopted neural usually involves millions of parameters be optimized, direct application NCS may face great challenge large-scale space. To address this issue, paper presents an NCS-friendly Cooperative Coevolution (CC) framework scale-up while largely preserving its behavior. issue traditional CC can deteriorate also discussed. Empirical studies on 10 popular Atari games show method significantly outperform three state-of-the-art deep methods with 50% less computational time by effectively exploring 1.7 million-dimensional

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Cooperative Coevolutionary Ensemble Learning

A new optimization technique is proposed for classifiers fusion — Cooperative Coevolutionary Ensemble Learning (CCEL). It is based on a specific multipopulational evolutionary algorithm — cooperative coevolution. It can be used as a wrapper over any kind of weak algorithms, learning procedures and fusion functions, for both classification and regression tasks. Experiments on the real-world prob...

متن کامل

A Comparison of Evolutionary and Coevolutionary Search

We present a comparative study of an evolutionary and a coevolutionary search model. In the latter, strategies for solving a problem coevolve with training cases. We find that the coevolutionary model has a relatively large efficacy: 41 out of 50 (82%) of the simulations produce high quality strategies. In contrast, the evolutionary model has a very low efficacy: 1 out of 50 runs (2%) produce h...

متن کامل

Coevolutionary networks of reinforcement-learning agents

This paper presents a model of network formation in repeated games where the players adapt their strategies and network ties simultaneously using a simple reinforcement-learning scheme. It is demonstrated that the coevolutionary dynamics of such systems can be described via coupled replicator equations. We provide a comprehensive analysis for three-player two-action games, which is the minimum ...

متن کامل

Cooperative Inverse Reinforcement Learning

For an autonomous system to be helpful to humans and to pose no unwarranted risks, it needs to align its values with those of the humans in its environment in such a way that its actions contribute to the maximization of value for the humans. We propose a formal definition of the value alignment problem as cooperative inverse reinforcement learning (CIRL). A CIRL problem is a cooperative, parti...

متن کامل

Integrating System Optimum and User Equilibrium in Traffic Assignment via Evolutionary Search and Multiagent Reinforcement Learning

Traffic assignment is fundamentally a tool for transportation planning. It allocates trips within the traffic network. However, modern uses of traffic assignment also include shorter time horizons and even real-time use (e.g., for route recommendation). In the latter case, it is interesting to recommend routes that are as close as possible to the system optimum. To compute an approximation of t...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Swarm and evolutionary computation

سال: 2022

ISSN: ['2210-6502', '2210-6510']

DOI: https://doi.org/10.1016/j.swevo.2021.100974